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Abstract 

We present a new type of monotone submodular functions: multi-peak submodular functions. Roughly 
speaking, given a family of sets J-, we construct a monotone submodular function / with a high value 
f{S) for every set S £ J- {a. "peak"), and a low value on every set that does not intersect significantly 
any set in J^. 

We use this construction to show that a better than (1 — ^)-approximation (~ 0.816) for welfare 
maximization in combinatorial auctions with submodular valuations is (1) impossible in the communica- 
tion model, (2) NP-hard in the computational model where valuations are given explicitly. Establishing 
a constant approximation hardness for this problem in the communication model was a long-standing 
open question. The valuations we construct for the hardness result in the computational model depend 
only on a constant number of items, and hence the result holds even if the players can answer arbitrary 
queries about their valuation, including demand queries. 

We also study two other related problems that received some attention recently: max-min allocation 
(for which we also get hardness of (1 — ^ + e)-approximation, in both models), and combinatorial public 
projects (for which we prove hardness of (| + e)-approximation in the communication model, and hardness 
of (1 — i -f- e)-approximation in the computational model, using constant size valuations). 

1 Introduction 

In a combinatorial auction there are k players and a set of M items {\M\ = m). Each players i has a 
valuation function Vi : 2*^ — >■ In this paper we use the standard assumption that the valuation functions 
are non-decreasing and that 1)^(0) = 0. The goal is to find an allocation of the items (S'l, . . . ,Sk) to maximize 
the social welfare T,iVi{Si). There is extensive literature on combinatorial auctions, and the two most popular 
lines of research in this area are the design of computationally efficient truthful mechanisms for combinatorial 
auctions and the development of (non-truthful) approximation algorithms (for various restricted classes of 
valuations, such as subadditive [131 [IH] or submodular [HKTB]). This paper belongs to the latter type. 

Specifically, we consider the case where the valuation functions are submodular, and ask what is the best 
approximation of the optimum social welfare that is achievable in polynomial time. The case of players with 
submodular valuations unique because of two reasons. First, submodular valuations have a natural economic 
interpretation of decreasing marginal utilities. Second, the rich literature on submodular optimization makes 
the study of combinatorial auctions with submodular valuations an ideal midway point for the exchange of 
techniques and ideas between algorithmic game theory and combinatorial optimization. 

We would like our algorithms to run in time that is polynomial in k and m. However, in a naive 
representation of the valuation function we may have to specify one number for each possible bundle, so 
the total size is exponential in m. Algorithms for combinatorial auctions usually assume that the valuations 
are represented by oracles. The algorithm is assumed to make only a polynomial number of queries to the 
oracles. Popular queries include the value query (given a set S, what is w(S')?) and the stronger demand 
oracle (given prices pi, . ■ ■ ,Pm return a bundle S € argmax w(S') — S^ggpi). The state of the art is an (1 — 1/e)- 
approximation that uses value queries only |26| (which is optimal in the value oracle model |191 122| ). and a 
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(1 — 1/e + (5)-approxiination that uses demand queries only [T^ (where ^ > is some small constant). As one 
can observe from these results, different oracles may potentially enable us to obtain different approximation 
ratios. Our goal in this paper is to prove hardness results that will hold for every oracle. 
The literature contains two approaches to deal with this issue: 

• Communication complexity: Nisan and Segal pioneered this approach [231 [2S]. Players can trans- 
mit any information and thus answer any possible oracle query, but the number of bits transmitted is 
polynomially bounded. The algorithm and the players are conservatively assumed to be computation- 
ally unbounded. 

• Constant-size valuations: This approach was introduced in 17 . Each player has a valuation 
function that depends non-trivially only on a constant number of items, and its full description is 
available to the algorithm (so any query whose answer depends only on the valuation can be answered 
in constant time - see a discussion in |16|). The algorithm is required to run in polynomial time. 

The two approaches are conceptually at two opposite ends of the spectrum: either arbitrarily complex 
valuations and infinite computational power, limited only by the communication bottleneck, or extremely 
succinct valuations and a polynomial-time algorithm. 

1.1 Our Main Result: The Hardness of Welfare Maximization 

We prove the following hardness result for welfare maximization, with the same factor in both models. 

Theorem 1.1. For welfare maximization with submodular valuations, a (1 — ^ + e)- approximation for any 
constant e > requires exponential communication. For constant-size submodular valuations, a (1 ~ ^ + s)" 
approximation would imply P — NP. 

We note that previously, only (1 — o(l))-hardness was known in the communication complexity model, 
and we show the first constant-factor hardness in this model (this question was open since the seminal 
paper of Nisan and Segal [15]). In particular, this settles affirmatively a conjecture of [TD]. NP-hardness of 
(1 — p)-approximation for some very small p > was known for constant-size submodular valuations [16]. 

Each of our results implies hardness of (1 — ^ + e)-approximation in the demand oracle model, because de- 
mand queries can be simulated in polynomial communication, and because demand queries can be efficiently 
implemented for constant-size valuations. This improves the known NP-hardness of ( y| -|-e)-approximation in 
the demand oracle model by Chakrabarty and Goel [5] . We stress that our results apply to any "reasonable" 
oracle model and not just to the demand oracle model; see also the discussion in [16j . 

We remark that the communication complexity result holds even for a constant number of players, in the 
sense that for every e > there is some fixed k > 2 such that the hardness result holds for k players. For 
the special case of 2 players, we get the following. 

Theorem 1.2. For welfare maximization with 2 players with submodular valuations, a (j^+e)- approximation 
for any e > requires exponential communication. 

For constant-size valuations, we clearly need a superconstant number of players, otherwise the problem 
can be solved in constant time. 

1.2 Our Techniques 

At the heart of our hardness proofs is a new construction of submodular functions, which has been the 
stumbling block of previous attempts to prove similar results. We call these multiple-peak submodular func- 
tions: given a family of (possibly overlapping, exponentially many) sets J-, we want a monotone submodular 
function with high value f{S) for every set S E J- {a "peak"), and a low value on every set that does not 
intersect significantly any set in J-'. The precise parameters depend on the family in particular how much 
the sets in J- are allowed to overlap. As an aside, this construction goes in the opposite direction to the 
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Figure 1: Summary of our hardness results: "CC hardness" refers to communication complexity, or more precisely 
inapproximability under polynomial communication, and "NP-hardness" refers to inapproximability for constant-size 
valuation functions. The approximation results shown here are in the demand oracle model for submodular welfare 
[17| . in the value oracle model for submodular max-min allocation (*for a constant number of players) [9], and in the 
value oracle model for submodular CPP In the value oracle model, 1 — 1/e is optimal for all 3 problems. 



(very different) construction of Balcan and Harvey pi, where a submodular function is constructed that has 
very low value on a given family of sets. 

This construction is then leveraged in two possible ways: for communication complexity, we use the 
general framework introduced by Nisan [23]. We use the probabilistic method to prove the existence of 
an exponentially large family F and show a reduction from the Set Disjointness problem. Each player has 
a multiple-peak submodular valuation corresponding to a family J-i, where either there is a choice of sets 
Si G J-i that are "consistent" (disjoint/equal, depending on the problem), or not - distinguishing these two 
cases requires exponential communication. 

For NP-hardness results, we start from Feige's proof of inapproximability of Max fc-cover |14j . Then we 
use the coverage system to construct an instance with multi-peak submodular functions. In the YES case, 
the instance contains matching sets, providing high value for each player, while in the NO case, some players 
are forced to receive low value. In addition, Feige's construction guarantees that the valuation functions 
depend only on a constant (albeit large) number of items. Hence the computational hardness result applies 
even if the bidders have constant-size valuations. 

1.3 Other Results 

We use the new multiple-peak submodular functions to prove hardness results for every oracle in two other 
related settings. One is Combinatorial Public Projects (CPP), where we seek a single set S of at most k 
items, maximizing '^ii^)- Another one is the Max-min Allocation Problem, where we want to allocated 

disjoint sets of items Si in order to maximize mini<i<j. Vi{Si). 

Combinatorial Public Projects. For CPP there is an (1 — l/e)-approximation that can be achieved 
using value queries only |23| . We show that this is optimal in any arbitrarily powerful oracle model even for 
constant size valuation, hence strengthening previous result that showed that this result cannot be improved 
using demand queries [6]. We note that the situation is different for the welfare maximization, where 
1 — 1/e is optimal for value queries but not optimal if demand queries are allowed |17j . No constant-factor 
communication complexity hardness was previously known for the CPP problem. 

Theorem 1.3. For the CPP problem with submodular valuations, a (| t)- approximation for any e > 
requires exponential communication. For 2 players with submodular valuations, a (| -|- e)- approximation for 
any e > requires exponential communication. For constant-size submodular valuations, a (1 — i + e)- 
approximation would imply P — NP. 

M£Lx-min Allocation. Finally, our results for Welfare Maximization also apply to the Submodular Max- 
min Allocation problem. The problem of obtaining a max-min allocation was quite heavily studied recently 
[5J [31 [121 [H ni , and most of the work concentrated on cases where the valuations are special cases of sub- 
modular valuations. It is known that the Submodular Max-min Allocation problem admits a (1 — 1/e — e)- 
approximation for any fixed e > and any fixed number of players k [9], and this is optimal in the value 
oracle model, if we want approximation independent of k [22\ . Our hardness results also hold for a fixed 
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number of players k (approaching 1 — ^ for k fixed but large), and they are independent of a particular 
oracle model. When the number of players is not fixed, the problem is less understood, and the best known 
approximation is 0(to^/^+'^) [T8l [7]. 

Theorem 1.4. For Max-min Allocation with submodular valuations, a (1 — j^+e)- approximation for any e > 
requires exponential communication. For constant-size submodular valuations, a (1 — ^ + e) -approximation 
would imply P — NP. For Max-min Allocation with 2 players with submodular valuations, a (y| + e)- 
approximation for any e > requires exponential communication. 

The proof is obtained by a straightforward adaption of the proof for Welfare Maximization, as we explain 
in Section l473l 

Organization. The rest of the paper is organized as follows. In Section [2l we give the necessary prelimi- 
naries. In Section [31 we present the construction of multiple-peak submodular functions, the main technical 
ingredient of our proofs. In Section 21 we present the proofs for Welfare Maximization, and we also note why 
they apply to Max-min Allocation. In Section |B1 we present the proofs for Combinatorial Public Projects. 



2 Preliminaries 

A function / : {0, 1}" ^ R is submodular if f{A) + f{B) > f{A U B) /(A n B) for all A,B (and 
sets are naturally identified with {0, 1} vectors). By calling a function monotone, we mean non-decreasing, 
i.e. f{A) < f{B) for all ACS. We use the following connection between continuous and discrete submodular 
functions (slightly modifying the conditions used in [20] and [27] , where "smooth submodular functions" were 
used). 

Definition 2.1. We call a function F : [0, 1]" — >■ M continuous submodular, if all of the following conditions 
hold: 

• F is absolutely continuou^ on every line segment of direction Bi, for every i. 

• The partial derivative is defined almost everywhere on every line segment of direction e^ . 

• is non-increasing with respect to every coordinate. 

Lemma 2.2. If F : [0, 1]" — > M zs continuous submodular, then its restriction to {0, 1}" is submodular. 

Proof. Since F is absolutely continuous, we can express differences between point values by integrating the 
partial derivatives: assuming that x e {0, 1}" and Xi = Xj = 0, 



lo dxi 



dt. 



F(x + eO - ^^(x) = / 
Jo 

Since is non-increasing with respect to every coordinate, in particular Xj, we get 

F{x + e^ + Bj) - F(x + ej) < F(x + e,) - F(x). 
This means that F restricted to {0, 1}" is submodular. □ 

In this paper, (x)_|_ = max{x,0} denotes the positive part of a number. We remark that when we write 
{x)\, we mean (maxja::, 0})^, i.e. for a; < this quantity is 0. 



function _F : R — ^ R is absolutely continuous if for every e > there is <5 > such that whenever \xi — yi\ < 5, 

T.,\F{=^i)-F{y^)\<^■ 
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3 Construction of Multiple-Peak Submodular Valuations 



In this section, we construct a new class of submodular functions. These functions are the main technical 
elements of our hardness results. Given a family of sets we would like to define a function that has a high 
value on each set in T ^ and low value on every set that does not overlap any set in T very much. We call 
these functions multiple-peak submodular valuations. 

The construction is inspired by the "continuous point of view" , which has been used in previous work 
on submodular optimization |22i I26[ 127] , and the connection between continuous and discrete submodular 
functions given by Lemma [2^ Suppose we start from a continuous submodular function F : [0, 1]*''^ -> R4. 
that depends only on the sum of coordinates ^x^, in particular -F'(x) = 1 — (1 — a^Xi)^. In addition, we 
have a collection T of vertices of the hypercube that should become the "peaks" of the function, i.e. their 
value should be increased compared to -F(x). We define a region of the hypercube Ba C [0, 1]*^ around each 
vertex A G such that these regions are disjoint. Then if we modify the function F so that it is continuous 
submodular on each region separately, and first partial derivatives are continuous on the boundary of each 
region, then the resulting function is still continuous submodular. Crucially, we can perform this operation 
on each region Ba independently, since there is no interaction between different (disjoint) regions, due to the 
local nature of continuous submodularity - we only have to ensure that first partial derivatives are continuous 
on the boundary of each region. 

In particular, consider a region defined by Ba = {x G [0, 1]^^ : J2ieA ~ ^i<^A > b}- We can define 
Fa(x) = 1 — (1 — a(2X]ieA^* ~ ^))+(l ~ '^(2 X^j^A -^^ + ^))+- This function is continuous submodular on 
Ba- Moreover, it agrees with F(x) on the boundary of Ba and also its first partial derivatives are the same. 
Therefore, we can glue the two functions seamlessly at the boundary of Ba and the resulting function is still 
continuous submodular. 

A discrete formulation of this idea follows. 

Definition 3.1. A set S Q M is said to he &-close to a set A, if \S f\ A\ — \S \ A\ > b. More generally, a 
point X e [0, 1]*^ is &-close to A, if X^igA ~ Tli^A ^■ 

Note that S being b-close to A is the same as the characteristic vector I5 being b-close to A. 

Proposition 3.2. Let A, A' C M be two sets such that \Ar\A'\ < b. Then every point x G [0, l]'*^ is &-close 
to at most one of the two sets A, A' . In particular, every set S <^ M is fe-close to at most one of A, A' . 

Proof. Suppose towards contradiction that x is &-close to both A and A'. That is, XieA^* ~ Xi^A -^i ^ 
and XieA' -^^ ~ Xi^A' ^ ^- Adding up these two inequalities, we get 

2b < i^xi - y^xi) + (y^ - xi) 

i£A ii^A ieA' i^A' 

= Xj - Xt + - y^xt) + (y^ x— Xj + - 

ieA i^A ieA ieA ieA' i^A' ieA' ieA' 

= (2 ^ Xi - ^ Xi) + (2 ^ Xi - ^ Xi) 

ieA i ieA i 

— 2 ^ I 2 ^ 2 ^ 

ieA ieA' i 

= 2 ^ Xi-l-2 ^ x,-2^x^<2\AnA'\<2b 

ieAnA' ieAuA' i 

We have reached a contradiction and therefore x is close to at most one of the sets. □ 

Definition 3.3. A family of sets T C 2*^ is called 6-intersecting if\Ar\A'\ <b for all A, A' e T, A ^ A' . 

Definition 3.4 (Multiple-Peak Submodular Valuation). Let T be a b-intersecting family and a > some 
constant. A function f is called (T, a, b) -multiple-peak function if it is defined as follows: 
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1. For a set S that is b-close to some A E J- (unique by the above), 



f{S) = 1 - (1 - a{2\S nA\^ 6))+(l - a{2\S \ A\ + b))+. 

2. For a set S that is not b-close to any A J-, 

fiS) = l-il-a\S\)l. 

Lemma 3.5. For every constant a > and every b-intersecting family T , the [J- , a, b) -multiple-peak function 
f (as defined in Definition \S.4^ is monotone and submodular. 

Proof. We define the following function F : [0,1 



F{^) = l-{l-aY,Xi) 



One can verify that this function is continuous submodular, because it is absolutely continuous and its partial 
derivatives are non-increasing. In addition, for each A G J^, we define 



Fa(x) = 1 - (1 - a(2 ^ X, - b))+{l - a{2 ^ a;, + 6))- 



Again, it is easy to see that Fa is continuous submodular, by checking that its partial derivatives are 
non-increasing. 

Define Ba = {x G [0, 1]*^ : X^ieA^' ~ Si^A^* > region 6-close to A. We know that these 

regions are disjoint for A E J-. We define a function F(x) to be equal to ^a(x) for x £ Ba, and equal to 
F(x) when x ^ Ba for any Ae T. 

We need to prove that F is a continuous submodular function. Since F and Fa are continuous submodular 
functions, we only have to verify that they agree on the boundary of Ba up to first partial derivatives. Note 
that the boundary of Ba intersects any axis-parallel line segment in at most one point. Therefore this will 
imply that on any line segment parallel to e.;, F is absolutely continuous, is defined almost everywhere, 

and it is non-increasing with respect to every coordinate. Therefore, F is continuous submodular. 

For each boundary point x of Ba, we have 2 X^ieA Xi — b — 2 X^i^yi Xi -\-b — Xi. Therefore, -F!a(x) = 
F(x). In addition, if 1 — a^^Xi < 0, then F{x.) — Fa{x) = 1 and all partial derivatives are 0. Finally, 
assume that 1 — aj^i^i > 0- For j £ A we have 

dF A OF 
— i = 2a(l-a(2^x, +5)) = 2a(l-a^a;,) = — . 

Similarly, for j ^ A, we have 

dF A dF 
— ^ = 2a(l - a(2 Vx, - b)) ^ 2a(l - a Vx^) = — . 
ax^ axj 

■' i&A I ■' 

Hence F and Fa agree on the boundary of Ba up to their first partial derivatives. 

Therefore, F is a continuous submodular function and also monotone (non-decreasing). By Lemma [2.2[ its 
restriction to {0, 1}^"^ is monotone submodular. This is the multi-peak submodular function of Definition l3.4l 

□ 

4 Welfare Maximization 

In this section we prove our three results regarding welfare maximization: for every constant e > 0, a 
(1 — ^ + e)-approximation requires exponential communication, for 2 players, a y| -I- e requires exponential 
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communication and that for constant-size submodular valuations, a (1 — ^ + e)-approximation would imply 
P = NP. We then note why the same proof implies hardness result for max-min allocations fTheorem ll.4p . 
All results share the same basic ideas and use multi-peak submodular functions. The multi-peak submodular 
functions are defined using a special kind of a set system: 

Definition 4.1. A collection of sets S C 2*^ that is partitioned into groups Si, . . . ,Sk is called well struc- 
tured if all of the following holds: 

• The total number of elements in the universe is m = k ■ s. 

• There is some number s such that for each set S £ S, \S\ — s. 

• = . . . = |5a;|. 

• There exists some b such that for every i G [k] and every S, S" G Si, we have \S H S'\ < 6. 

Given a well structured collection, we identify the k groups of sets with k players. The set of items is 
M. For each player i, we define a a, 6)-multi-peak submodular valuation function Vi : 2^^ — ^ M+. The 
precise choice of a, b is application specific and will be given later. 

We will say that an instance is a YES instance if there exist k disjoint sets, one from each group Si, 
whose union is M . An instance is a NO instance if for any choice of ^ < fc sets, their union covers at most a 
(1 — (1 — 1/kY + e)m items. 

The inapproximability gap that we prove is the result of the intractability of distinguishing between YES 
instances and NO instances. We start with calculating the welfare in YES instances. The more involved 
task is to upper bound the welfare in NO instances. 

Claim 4.2. The optimal welfare in a YES instance is at least k{l — (1 — a(2s — + — a6)+). 

Proof. This is a YES instance so for each bidder i there is a set Si G Si such that the S'i's are disjoint 
and cover the universe. A feasible allocation gives Si to player i, and each player collects value Vi{Si) — 
1 — (1 — a{2s — — a6)+. So the overall welfare of this allocation is fc(l — (1 — a{2s — — ab)+). □ 

Consider now a NO instance. There is no such choice of disjoint sets as in YES instances, and moreover 
any selection of £ sets such that Ai G Si satisfies | lJi=i — (1 ~ (1 ~ 1/^)^ + ^)ks. Consider any feasible 
allocation of disjoint sets {Si, . . . ,Sk)- For each Si, let Ai G Si be the set in Si closest to Si, i.e. maximizing 
S'i n — \Si\Ai\. Our goal now is to upper bound the maximum welfare such allocation can achieve. 

For each i, we define variables Xi — ^\Sir\Ai\,yi = ^\Si\Ai\. Without loss of generality, xi > X2 > x^ > 
. . . > Xk- For every £, the first £ variables Xi satisfy the constraint that Yli=i — (1 — (1 — (arising 
from the properties of the set system in NO instances). In addition, we have the constraint 
meaning that we cannot allocate more than the total number of elements, ks. The following technical lemma 
bounds the value that can be obtained under these constraints. (We note that the parameters a, P in the 
lemma are related to a, 6 by a = a • s and j3 = b/s.) 

Lemma 4.3. Let xi, yi, . . . , Xk, yk ^ be such that 

• Xi> X2> ■ ■ ■> Xk, 

• for each £<k, < {1 - {I - l/kf + e)k, 

• Y.\=l{^^+y^) < fc- 

Let v{x, y) = 1- (1 — Q:(2a;-/3))+(l — a(2y + ;5))+ if x~ P > y, and v{x, y) = 1- {l-a{x + y))'^ if x~ P < y. 
Then, 

Y^v{x,,y,) < ^max^2afc + 1 - ^-^{{1 - l/k f - ef 
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Moreover, for k = 2 and P > j + e we have that 

^«(x„y.) = 2-(l-«(2-i±^))^. 

i—l 

We defer the proof to Appendix |^ Given this lemma, we obtain bounds on the ratio between YES and 
NO instances. 

Corollary 4.4. The ratio between the social welfare in a NO instance and a YES instance is: 

• 1 — ^ + 0(e) for a ~ ^, (3 — e and large enough k. 

• tI + /or a = |, /3 = i + 2e and k ^ 2. 

Proof. We show the first part by optimizing for the best value of k* in the NO case: Let z — k* jk. For large 
fc, we can approximate (1 — XjkY^ ~ e^^^. Hence we maximize the function 0(z) = k — 
z € (0, 1) (the rest of the terms approach and we ignore them in this analysis). By elementary calculus, 
(f>{z) is maximized at z = 1/2, and the maximum value is (/)(|) = k{l — ^). In other words, k* — k/2 is the 
best choice, and gives 

^«(x„yO = (1 - e-i/2)fc + ifc(l - (1 - e-'/'f) = (1 - yjk. 

i—l 

In the YES case, the optimum is fc(l - (1 - a(2 - /3)) + (l - q:/3)+) = fc(l - ie(l - ^e)) > k{l-e). Therefore, 
the ratio is 1 — ^ + 0(e). 

For the second part, we choose a — ^ and /3 = ^ + 2e, which gives an optimal solution in the NO case 
xi = 1 + e, 7/1 = 0, a;2 = 0, 2/2 = 1 — e. The value of this solution is 

^7;(x„2/0 = l + l-(l-«2/2)^=2-(l-^(l-e))2^=2-(l + ^e)2<^. 

In the YES case, as we argued, the optimum is 2(1 - (1 - a{2 ~ /3))+(l - al3) + ) = 2(1 - (1 - |(2 - i - 
2e))(l - |(i + 2e)) = 2(1 - ie(| - ^e)) > 2 - e. Therefore, the ratio is + 0(e). □ 

4.1 Communication complexity of welfare maximization 

Here we prove the first part of Theorem II. 1[ and Theorem 11.21 for every constant e > 0, a (1 — 2^ + 
e)-approximation requires exponential communication. For 2 players, a (y| + e)-approximation requires 
exponential communication. This is the first constant-factor impossibility result for submodular valuations 
in the communication model. The basic idea is to reduce from the Set Disjointness problem. In the Set 
Disjointness problem there are k players, each player i holds a string a;' G {0, 1}*. The following theorem 
establishes the hardness of Set Disjointness. 

Theorem 4.5 (|1]). It requires bits to distinguish between the following two cases: 

1. There exists some 7 such that x\ — . . . — x^: = 1. 

2. For each two different players i and i' and every j it holds that if x'j = 1 then x'j = 0. 

The lower bound holds also for randomized and non- deterministic algorithms. 

The plan is to reduce Set Disjointness (with an exponential t) to an instance of combinatorial auctions 
with multiple-peak submodular functions. Then we would like to show that a good approximation algorithm 
will let us decide the Set Disjointness problem. This will show that a good approximation algorithm requires 
exponential communication. The first step is to show the existence of an exponential well-structured set 
system. 
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Lemma 4.6. For a universe M, \M\ = ks = m and for any e > 0, there is a collection of t ^ 2®^"^ '"/'^ ^ 
partitions — (C^, . . . ^Cj.) such that 

• For each j e [t], ULi = M. 

• For each i £ [k],j G [t], |Cf | = s. 

• For any i < i' and any H C^, | < ^-^s. 

• For every distinct ii, ...ii and any ji, . . . ,je, we have \C(^ U . . . U Cf^ | < (1 — (1 — + e)ks. 

Proof. This is a slight modification of the probabilistic construction of . Consider M partitioned into s 
fc-tuples (e^, . . . , e^), 1 < r < s. We generate each partition as follows: from each fc-tuple (e^, . . . , ep, 
we include each element in exacly one of the sets , . . . , , using an independently random bijection 
{el, . . . , e|J} — > {Cf, . . . ,Cl}. Thus the first two conditions are satisfied by construction. 

Consider any pair Cf , C^, for i ^ i' . Each set is generated by taking (independently) a random element 
from each /c-tuple (e^^, . . . , eJJ). Therefore, the probability of taking the same element is for each fc-tuple 
and these events are independent. The expected size of the intersection |C/ nC^, | is s/fc, and by the ChernofI 
bound, the probability that the intersection is larger than Pr[|Cf n Cl, | > (1 + e)s/fc] < e^^^'^ ^1^) ^ 

Similarly, consider any choice of distinct (ii, . . . , i^) and any (ji, . . . Each element appears in Cf^ U 
...UCil^ with probability 1 — (1 — l//c)^, and hence the expected cardinality of the union is (1 — (1 — l/fc)^)fcs. 
We can write \Cf^ U . . . U C|^' | — X]r=i -^r where Xr is the number of elements of the r-th /c-tuplc that are 
contained in the union. The random variables Xr are independent, in the range [0, fc], and hence by the 
Chernoff bound Pr[J2UiXr > (1 + e)EE'=i ^r]] < e-^^'''/''\ 

If the number of partitions is t, we have 0{t^) combinations of sets to consider. By the union bound, the 
probability that any of the bad events above occurs is 0(t'^)e~^^^ ^/f^) _ For t — 2®^^ \ the probability is 
smaller than 1 and hence there is a collection of partitions satisfying the assumptions. □ 

Given an instance of Set Disjointness, consider the following well structured collection of sets: We con- 
struct Cf using Lemma 14.61 and we include Cf in Si if and only if a;* = 1 . Consider the k multiple-peak 
submodular functions that are based on this well structured collection of sets (with b = ^^s). Observe that 
if there exists some j such that = . . . = = 1 then this is a YES instance. However, if for each two 

different players i and i' and every j it holds that if — 1 then x'j = this is a NO instance. Hence it 
requires exponential amount of communication to distinguish between these two cases (even for 2 players) 
and the hardness of approximation in the communication model follows from CoroUarv 14.41 

4.2 Computational complexity of welfare maximization 

We now prove the second part of Theorem ll.il The starting point of our reduction is Feige's inpproximability 
result for Max fc-cover [H]. We summarize it here, with some additional useful properties of the set system 
that arises from Feige's reduction (see also [T3] and [IB] for some comments and explanations). 

Hardness of Melx fc-cover. For any fixed e > 0, it is NP-hard to distinguish the following two cases for 
a given collection of sets S C 2*^, partitioned into groups Si, . . . ,Sk: 

1. YES case: There exist k disjoint sets, one from each group Si, whose union is the universe M . 

2. NO case: For any choice of £ < k sets, their union covers at most a (1 — (1 — 1 / kY + e) -fraction of M . 
In addition, the set system can be assumed to have the following properties: 

• Every set has the same (constant) size s. 
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• Each group contains the same (constant) number of sets g. 

• Every element appears in the same (constant) number of sets d. 

• Any two sets intersect in at most es elements. 

We remark that the properties up to the last two are discussed in [T51 [TB] . The last two properties follow 
from the deterministic construction for max fc-cover in [14j . as follows: 

The sets are unions of (L — l)-dimensional layers in disjoint hypercubes [k']^, where k' is the number of 
provers, and each hypercube corresponds to a particular random string r generated by the verifier. (Here, 
k' and L are both constant.) Each set corresponds to a triple (q,a,i) where g is a question, a is an answer, 
and I is a prover. The direction of the layer that a set uses in a certain hypercube corresponds to the answer 
a, and the position (shift) of the layer corresponds to the prover i. Given a random string r and a prover 
i, the question q is determined by r and i. Therefore different sets for the same prover i, restricted to the 
same hypercube r, must differ in their answer a. Hence, they are orthogonal in the respective hypercube. In 
particular, for each prover i there can be at most L different sets participating in the same hypercube, one 
for each dimension. As the number of provers and L are constant, there is only a constant number of sets 
participating in each hypercube. As shown in |15) . it can be arranged that the number of sets containing 
any given element is the same, and by the above it must be a constant. 

For two different provers, the respective sets restricted to one hypercube are layers indexed by a different 
i, and hence they are either orthogonal or disjoint. Either way, any two different sets are either disjoint or 
intersect in a l/fc'-fraction of their size in each hypercube where they both participate. Overall, two different 
sets can intersect in at most a l/fc'-fraction of their elements. The number of provers fc' can be chosen large 
enough so that 1/fc' < e (in any case fc' has to increase as e ^ 0, for other reasons). 

Reduction. We describe a reduction from the Max fc-cover problem to a combinatorial auction with 
submodular valuations which works as follows. Given an instance of Max fc-cover as above, we identify the fc 
groups of sets with k players. The set of items is the universe M. For each player i, we define a multiple-peak 
submodular valuation function Vi : 2*^ — > R+. The size of each set in Si is s (a constant), and the number of 
sets is g, also a constant. Hence, the set system Si covers only a constant number of elements < sg, and we 
apply the construction of Definition 13.41 only to these elements (i.e. other elements have value for player 
i). From the properties of the Max fc-cover instance, we know that any two sets in Si can overlap in at most 
es elements, hence the parameter b can be set to 6 = es. We also set a = From the hardness of Max 
fc-cover it follows now that it is NP-hard to distinguish between the YES and NO instances, with a ratio of 
optimal values 1 ^ + 0(e) due to CoroUarv 14.41 

4.3 Max-min allocation 

We note that the same proofs apply to the Max-min Allocation problem as well. This is because the 
reduction produces instances such that in the YES case, all players receive the same value. In the NO case, 
the minimum value is upper-bounded by the average value over all players. Therefore, the gap in terms of 
the objective minKi<kVi{Si) is at least as large as in terms of welfare maximization. In fact the hardness 
factors can be (slightly) improved for Max-min allocation, but we defer this to the full version of the paper. 

Acknowledgment. JV wishes to thank Uri Feige, who inspired the computational-complexity part of this 
paper in a conversation 6 years ago. We would also like to thank Ashwinkumar Badanidiyuru Varadaraja 
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A Proof of Lemma 14.31 

This section is devoted to proving Lemma We consider some optimal solution and massage it to get the 
specified bounds. The following simple claims show several structural properties that an optimal solution 
can be assumed to possess. 

Claim A.l. The sequence Xi > X2 > ■ ■ ■ > Xk is non-zero up to a certain index k* , Xi — (3 > yi for i < k* 
and then Xi — for i > k* . 

Proof. If Xi — P < yi, we can assume that Xi = (by replacing yi by Xi + yi and Xi by , nothing changes) . □ 
Claim A. 2. If Xi — Xj, i < j < k* then yi = yj = 0. 

Proof. Suppose that Xi = Xj, i < j < k* and suppose yj < yi 7^ 0. The derivative ^ at Xi equals the 
derivative at Xk- So we can move some small amount of mass from yi to yj, while making sure that still 
Xj — P > yj. Observe that the value of the solution is the same. But once yi > yj, it is already more 
profitable to increase Xi. So the case where Xi = xj is not optimal, unless yt = yj = 0. □ 

Claim A. 3. All the variables yi for i > k* are equal. We denote their value by y* . 

Proof. If not, we could gain by making them equal, due to the concavity of v{Q, y) = 1 — (1 — \y)\- □ 
Claim A. 4. For all yi < k* , yi — 0. 

Proof. Consider some yi < k* with yi > 0. Notice that Xi > Xk* (by Claim [^12] if Xi — Xk' then yi — 0), and 
hence the derivative ^ at Xi is smaller than the derivative ^ at Xk' ■ So moving some mass from yi to yt- 
improves the value of the optimal solution — a contradiction. □ 

A.l The case of 2 players 
Claim A.5. k* > 1. 

Proof. Suppose k* — 0. Then, by the previous claims we have that xi ~ X2 ^ and j/i = t/2 = 1- The value 
of this solution is 2 — 2(1 — a)"^. However the solution in which xi = 7/2 = 1 and X2 — yi — has value 
2 — (1 — a)^ ~ (1 ^ «(2 — /3))+(l — a/3)+ which is no smaller by using simple arithmetic. □ 

Claim A.6. yi = 0. 

Proof. Suppose j/i > 0. It must be that fc* = 1 (by Claim El4)) . If 1 — a{2xi — b) < then yi does not 
bring any benefit and moving all the mass of yi to ?/2 can only improve the value of the solution. Otherwise, 
moving some mass from j/i to xi (making sure that still 1 — a{2xi — 6) > 0) will improve the optimal solution 
— a contradiction. □ 

Claim A.7. If P>\ + 2e then k* = 1. 

Proof. By the constraints of the lemma we have that Xi+X2+yi+y2 ^2 and xi +X2 < 2(| + e). Without 
loss of generality, Xi+X2 + yi+y2 — 2, and hence Xi+X2^yi—y2 — 2(a;i +X2) — {xi +X2+yi+y2) < 1 + 4e. 
But if k* = 2, summing up xi — /3 > ?/i and X2 — /3 > 2/2 we get xi + X2 — j/i — 2/2 > 2/3 > 1 + 4e. This is a 
contradiction. □ 
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Claim A.8. xi = 

Proof, xi — ^^2^ is the point at which 1 — a,{^x\ — /3) =0, i.e. the first player is saturated and does not 
benefit from any additional items. If 1 — a{2xi — /3) < 0, we can move some mass from xi to ?/2 and improve 
the optimal solution. If 1 — a{2xi — /?) > 0, we can move some mass from X2 or y2 to xi and improve the 
optimal solution. □ 

This implies that the following is an optimal solution: xi — ^2 = 2 — ^3^, X2 — yi — 0. The first 

player gets value 1, and the second player 1 — (1 — ay2)\', hence the optimum is 

2 

Y^vix^y,) ^ 1 + 1 - (1 - ay2)l = 2 - {1 - a{2 - 
1=1 

This proves the second part of Lemma 14.31 



A. 2 The case of many players 

By the series of claims we proved above, we have a sequence xi > X2 > ■ ■ ■ > Xk*, yt — for i < k*, and 
then yk'+i — yk'+2 = ■ ■ ■ = Uk = y* , Xi = for i > k*. The value of this solution is at most (we upper 
bound the value of player k* by 1): 

k fc*-l k 

J2v{x.,,y,) - 5] (l-(l-a(2x. -/?))+(! -a/3)+) + l+ ^ (1 - (1 - m/)^) 

i=l i—1 i—k*-\-l 

k*-l k 

i=l i=fc*+l 

We have the global constraint J2i=i +Si=fc*+i Di — ^^^^ ^^^^ J2i=i ^ ^(1 ^ (1 ^ 1/^)'' +^)- Subject 
to these constraints, it is profitable to make the variables Xi as large as possible, because they bring more 
benefit than the yi variables (see the formula above). Therefore, we can assume that both constraints are 
tight: E£i = k{l - (1 - l/kf + e), and y^ = {k - k*)y* = ((1 - 1/k'f - e)k. Then we have 

y* = _A_((i - l/k)^* - e). The objective value is 

k k'-l 



Y^x^^Vi) ^ 2a ^ a;, + l + (fc-fc*)(l-(l-a2/*)2) 



= 2afc(l - (1 - \r + e) + 1 + (fc - fc*)(l - (1 - ^^((1 - lA)'' - ^)?) 
= 2ak + l-^^{{l~l/kr -ef. 



This proves the first part of Lemma 14.31 



B Combinatorial Public Projects 

Here, we prove Theorem II. 3[ in two parts. Due to the similarities with welfare maximization, we only point 
out the necessary differences. 
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B.l Communication complexity of CPP 

We reduce from an instance of Set Disjointness to CPP with submodular valuations as follows. We use 
Lemma [4.61 to construct a collection of partitions {Cf, . .. ,Cl),l < j < t. The difference now is that we 
use Cl for all k players, rather than Cf for player i. We include Cl in collection Si if a;* = 1 in the Set 
Disjointness instance. By Lemma |4.6[ Si is a well structured collection. The valuation Vi of player i is going 
to be the (5^, a, &)-multiple-peak submodular function of Definition 13. 4[ where a — and b — ^-^s. The 

CPP instance is going to be niax{^^^^ Vi{S) : \S\ = s}. 

In the YES case, all players share an index j such that — x'j ~ . . . = Xj = 1, and hence Cl G Si for all 
i e [fc]. Then is a feasible solution of value v,{S() = 1 - (1 - a{2s - - ab)+ = 1 - ^(1 - ^) = 

1 — 0(1/ k) for each player. 

In the NO case, for every index j there is at most one player i such that a;* = 1; hence Cf £ Si for at most 

one player i. By Lemma 221 1^(0 \ < ■^^s = b. By Proposition 13.21 any set S can be 6-close to at most 
one set Cf. Therefore, for any feasible set S, at most one player derives a value defined by a 6-close set, and 
all the remaining k—1 players get value defined by the formula Vi{S) = 1 — (1 — a|5|)^ = 1 — (1 — as)"^ = 3/4. 
One player can receive value close to 1. 

For a large number of players (fc -> oo), the value per player tends to 1 in the YES case and 3/4 in the 
NO case. Therefore, if we could approximate the CPP problem within a factor better than 3/4, we could 
solve the Set Disjointness problem. 

For 2 players, we still use the construction above, but we only consider the first two players. In the YES 
case, both players get value close to 1. In the NO case, 1 player must get value close to 3/4, while the other 
player can get value close to 1. Thus if we could approximate the CPP problem for 2 players within a factor 
better than 7/8, we could solve the Set Disjointness problem. 

B.2 Computational complexity of CPP 

Here we prove the second part of Theorem 11.31 This is actually quite simple and does not require our 
construction of multiple-peak submodular functions. The same reduction has been used in [6] to prove that 
CPP is hard to approximation better than 1 — 1/e with unit-demand valuations. The only new ingredient 
here is the observation that the valuation functions can be assumed to depend only on a constant number 
of items. The result of [5] already implies that the factor of 1 — 1/e for CPP with submodular valuations 
cannot be improved in the demand oracle model; we strengthen this to the case of constant-size valuation 
functions. 

Proof of Theorem \1.3[ part 2. Consider an instance of Max fc-cover as above. We associate players with the 
elements of the universe M, and items with the sets in S. For each player e G Af , we define a valuation 
function as follows: Ve{T) = 1 if T contains any set covering element e, and otherwise. Due to the properties 
of the set system, this valuation function depends non-trivially only on a constant number of items (d), and 
it is a very special monotone submodular function (unit demand) of the form Vi,{T) = min{|T n Del, !}■ 

The objective function of the CPP problem is now X^cga/ ^eC^) which is the number of elements covered 
by the sets in T. Subject to the condition |r| < k, this is NP-hard to maximize within a 1 — l/e-|-e factor. □ 
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